Synthetic Humans for Action Recognition from Unseen Viewpoints
نویسندگان
چکیده
Abstract Although synthetic training data has been shown to be beneficial for tasks such as human pose estimation, its use RGB action recognition is relatively unexplored. Our goal in this work answer the question whether humans can improve performance of , with a particular focus on generalization unseen viewpoints. We make recent advances monocular 3D body reconstruction from real sequences automatically render videos labels. following contributions: (1) we investigate extent variations and augmentations that are improving at new consider changes shape clothing individuals, well more relevant non-uniform frame sampling, interpolating between motion individuals performing same action; (2) introduce generation methodology, SURREACT allows spatio-temporal CNNs classification; (3) substantially state-of-the-art NTU RGB+D UESTC standard multi-view benchmarks; Finally, (4) extend augmentation approach in-the-wild subset Kinetics dataset case when only one-shot available, demonstrate improvements well.
منابع مشابه
Towards Universal Representation for Unseen Action Recognition
Unseen Action Recognition (UAR) aims to recognise novel action categories without training examples. While previous methods focus on inner-dataset seen/unseen splits, this paper proposes a pipeline using a large-scale training source to achieve a Universal Representation (UR) that can generalise to a more realistic Cross-Dataset UAR (CDUAR) scenario. We first address UAR as a Generalised Multip...
متن کاملDynamic visual information facilitates object recognition from novel viewpoints.
Normally, people have difficulties recognizing objects from novel as compared to learned views, resulting in increased reaction times and errors. Recent studies showed, however, that this "view-dependency" can be reduced or even completely eliminated when novel views result from observer's movements instead of object movements. This observer movement benefit was previously attributed to extra-r...
متن کاملExtrinsic cues aid shape recognition from novel viewpoints.
It has been shown previously that the visual recognition of shape is susceptible to the mismatch between the retinal input and its representation in long-term memory, especially when this mismatch arises from rotations in depth. One possibility is that the visual recognition system deals with such mismatch by a transformation of the input or the representation thereby bringing both into alignme...
متن کاملAction Recognition in Semi-synthetic Images using Motion Primitives
This technical report describes an action recognition approach based on motion primitives. A few characteristic time instances are found in a sequence containing an action and the action is classified from these instances. The characteristic instances are defined solely on the human motion, hence motion primitives. The motion primitives are extracted by double difference images and represented ...
متن کاملAdaptation of Pronunciation Dictionaries for Recognition of Unseen Languages
This paper studies the relative effectiveness of different methods for multilingual model combination and dictionary mapping for recognizing a new unseen target language if training data are limited. We examine the crosslanguage transfer from monolingual and multilingual models to German and Russian language for large vocabulary speech recognition using a dictation database which has been colle...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2021
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-021-01467-7